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Abstract 

We introduce a simple tool called the wavelet (or, p-wavelet) scheme. Wavelets deals with 
coordination among processes which are at most g hops away of each other. We present a self- 
stabilizing solution for this scheme. Our solution requires no underlying structure and works in 
arbritrary anonymous networks, i.e., no process identifier is required. Moreover, our solution 
works under any (even unfair) daemon. 

Next, we use the wavelet scheme to design self-stabilizing layer clocks. We show that they 
provide an efficient device in the design of local coordination problems at distance g, i.e., g-barrier 
synchronization and p-local resource allocation (LRA) such as g-local mutual exclusion (LME), 
g-group mutual exclusion (GME), and g- Reader/ Writers. Some solutions to the g-LRA problem 
(e.g., g-LME) also provide transformers to transform algorithms written assuming any g-central 
daemon into algorithms working with any distributed daemon. 

Keywords: Barrier Synchronization, Local Synchronization, Resource Allocation, Self-Stabili- 
zation, Unison. 

1 Introduction 

Most of the distributed system are not fully connected networks. Each process is only directly con- 
nected with a subset of others, called neighbors. By this way, the communication links organize 
the network in a some graph topology which is either arbitrary or in accordance with some global 
topology constraints, e.g., acyclicity, constant degree, ring, grid, etc. Whatever the topology com- 
plexity, the design of a distributed task is simplified if it requires coordination mechanisms involving 
a process with its neighbors only, i.e., one hop away. Such distributed tasks are said to be local. 
Unfortunately, many distributed tasks requires coordination farther away than the immediate neigh- 
bors, i.e., q hops away with g > 1. If g is equal to the diameter of the network D, then the task is 
said to be global. 

In this paper, we consider problems requiring coordination among processes which are at most 
g hops away of each other. We present solutions having the desirable property of self-stabilization. 
The concept of self-stabilization [Dij74, DolOO] is an efficient approach to design distributed systems 
to tolerate arbitrary transient faults. A self-stabilizing system, regardless of the initial states of the 
processors and initial messages in the links, is guaranteed to converge to the intended behavior in 
finite time. 



Motivation and Related Works. Coordination at distance g received a particular attention in 
recent works. There are various motivations for this issue. The first one consists in the design of 
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£>-local computations [NS93], i.e., running in constant time independent of any global parameter like 
the size of the network or the diameter. Computation in constant time g can be achieved if the 
processes can collect informations from processes located within radius of g from them. In [NS93], 
the authors mainly address Local Checkable Labeling problems. Local computation is also considered 
in [GMM04] by considering the recognition problem. The computing model is a relabelling system. 

Wireless networks bring new trends in distributed systems which also motivate research the 
local control of concurrency at distance g. In [DNT06], the authors propose a generalization of the 
well-known dining philosophers problem [Dij68]. They extends the conflict processes beyond the 
immediate neighbors of the processes. As an application, their solution provide a solution to the 
interfering transmitter problem in wireless networks. 

Another motivation consists in assuming that the knowledge of the processes goes beyond their 
immediate neighbors could help in the design of non-trivial tasks [GGH+04, GHJT06]. An efficient 
self-stabilizing solution is given to the maximal 2-packing problem assuming the knowledge at dis- 
tance 2 [GGH + 04]. (The maximal 2-packing problem consists to find a maximal set of nodes S, such 
that no two nodes in S are adjacent and no two nodes in S have a common neighbor.) The solution 
in [GGH+04] requires process ID's and works under a central daemon. In [GHJT06], the authors 
propose a ^-distance knowledge transformer to construct self-stabilizing algorithms which use a g 
distance knowledge. Again, their solution works only if the daemon is central and with process ID's. 

Note that various kinds of transformers have been proposed in the area of self-stabilization to 
refine self-stabilizing algorithms which use tight scheduling constraints like the central daemon into 
the corresponding self-stabilizing algorithm working assuming weaker daemons, e. g., [MN98, GH99, 
NA02, CDP03]. A popular technique consists in composing the algorithm with a self-stabilizing local 
mutual exclusion (LME) algorithm [MN98, GH99, NA02]. LME allows to grant privileged processes 
to enter critical section if and only if none of their neighbors has the privilege, this infinitely often. So, 
Any LME-based solution does not allow concurrent execution of neighboring processes. The solution 
in [CDP03] is based on the Local Resource Allocation (LRA), which allows neighboring processes to 
enter their critical sections concurrently provided they do not use conflicting resources. It transforms 
any algorithms written in a high-atomicity model {e.g., with a central daemon) into the distributed 
read/write atomicity model by allowing neighborhood concurrency. 

However, none of the above solutions allows coordination farther than the immediate neighbors. 
So, they are not directly applicable to the method developed in [NS93, GGH+04, GHJT06, DNT06]. 

Contributions. In this paper, we introduce a simple tool called the wavelet (or, g-wavelet) scheme. 
Wavelets deals with coordination among processes which are at most g hops away of each other. 
Wavelets are related to the notion of wave (also called Total algorithm [Tel88, Tel04]). 

In this paper, we present a self-stabilizing solution to the £>-wavelet problem. There are several 
way to design the wavelet scheme depending on network properties. For instance, assuming a unique 
identifier on each process, in [DNT06], the authors provides a self-stabilizing £- wavelets scheme 
by combining a stabilizing Propagation of Information with Feedback (PIF) [BDPV99] over a self- 
stabilizing BFS spanning tree [HC92, Joh97] rooted at each process of height equal to g. 

By contrast, our solution requires no underlying structure and works in arbritrary anonymous 
networks, i.e., no process identifier is required. Our solutions is based on the unison in [BPV04] and 
works assuming any distributed (even unfair) daemon. 

Next, we use the wavelet scheme to design self-stabilizing layer clocks. The lower layer clock, in 
the sequel called the main clock, provides a wavelet stream. The upper layer clock, so called the 
slave clock, achieves a g— barrier synchronization mechanism, where no process p starts to execute 
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its phase i+1 before all processes in the g— ball centered in p have completed their phase i. 

Finally, we show that the layer clock also provides an efficient underlying device in the design 
of various local resource allocation problems at distance g. This problems include Mutual Exclu- 
sion [Dij65], Group Mutual Exclusion [JouOO], and Readers- Writers [CHP71]. Some of these solutions 
(e.g., £-LME) also provides transformers to transform algorithms written assuming any ^-central 
daemon into algorithms working with any distributed daemon. 

Paper Outline. The remainder of the paper is organized as follows. We formally describe nota- 
tions, definitions, and the execution model in Section 2. We also state what it means for a protocol to 
be self-stabilizing. In Section 3, we define the wavelet scheme, present our solution for this problem 
in an arbitrary anonymous networks, and show how it can be used as an infimum computation at 
distance g. In Section 4, we introduce the self-stabilizing layer clocks and show how they can be used 
to solve g- local coordination problems. Finally, we make some concluding remarks in Section 5. 

2 Preliminaries 

In this section, we first define the model of distributed systems considered in this paper. We then 
define the execution model and various general definitions such as events, causal DAG, and Coherent 
Cuts. We also state what it means for a protocol to be self-stabilizing. 

2.1 Distributed System 

A distributed system is an undirected connected graph, G = (V, E), where V is a set of nodes — \V\ = 
n, n > 2 — and E is the set of edges. Nodes represent processes, and edges represent bidirectional 
communication links. A communication link (p, q) exists iff p and q are neighbors. The distributed 
system is considered to be arbitrary and anonymous, i.e., we consider no particular topology nor 
unique identifiers on processes. 

The set of neighbors of every process p is denoted as M p . The degree of p is the number of 
neighbors of p, i.e., equal to \N P \. The distance between two processes p and q, denoted by d (p, q), 
is the length of the shortest path between p and q. Let g be a positive integer. Define V(p, g) as the 
set of processes such that d(p,q) < g, i.e., the g— ball centered at p. D denote the diameter of the 
network. 

The program of a process consists of a set of registers (also referred to as variables) and a finite set 
of guarded actions of the following form: < label >:: < guard > — >< statement >. Each process 
can only write to its own registers, and read its own registers and registers owned by the neighboring 
processes. The guard of an action in the program of p is a boolean expression involving the registers 
of p and its neighbors. The statement of an action of p updates one or more registers of p. An action 
can be executed only if its guard evaluates to true. The actions are atomically executed, meaning 
the evaluation of a guard and the execution of the corresponding statement of an action, if executed, 
are done in one atomic step. 

2.2 Execution Model 

The state of a process is defined by the values of its registers. The configuration of a system is the 
product of the states of all processes. Let a distributed protocol V be a collection of binary transition 
relations denoted by i— on T, the set of all possible configurations of the system. V describes an 
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oriented graph S = (T, \— >), called the transition graph of V . A sequence e = 70, 71, . . . , 7«, 7«+i, ... is 
called an execution of iff Vi > 0, 7, 1— > 7^+1 G S. A process p is said to be enabled in a configuration 
7i (7i £ r) if there exists an action ^4 such that the guard of A is true in 7$. Similarly, an action A 
is said to be enabled (in 7) at p if the guard of A is true at p (in 7) . We consider that any enabled 
processor p is neutralized in the computation step 7$ 1— > 7^+1 if p is enabled in 7, and not enabled 
in 7i+i, but does not execute any action between these two configurations. (The neutralization of a 
processor represents the following situation: At least one neighbor of p changes its state between 7, 
and 7j+i, and this change effectively made the guard of all actions of p false.) 

We assume an unfair and asynchronous distributed daemon. Unfairness means that even if a 
processor p is continuously enabled, then p may never be chosen by the daemon unless p is the only 
enabled processor. The asynchronous distributed daemon implies that during a computation step, if 
one or more processors are enabled, then the daemon chooses at least one (possibly more) of these 
enabled processors to execute an action. 

In order to compute the time complexity, we use the definition of round [DIM97]. This definition 
captures the execution rate of the slowest processor in any computation. Given an execution e, the 
first round of e (let us call it e') is the minimal prefix of e containing the execution of one action of 
the protocol or the neutralization of every enabled processor from the first configuration. Let e" be 
the suffix of e, i.e., e = e'e" . Then second round of e is the first round of e", and so on. 

2.3 Events, Causal DAG's and Cuts 

Definition 2.1 (Events) Let 7071.... be a finite or infinite execution. For all p G V, (p,0) is an 
event. Let 74 — > 74+1 be a transition. If the process p executes a guarded action during this transition, 
we say that p executes an action at time t + 1. The pair (p,t + 1) is said to be an event (or a p- 
event). Events so that the guard does not depend on the shared registers of any neighbor are said to 
be internal. 

Definition 2.2 (Causal DAG) The causal DAG associated is the smallest relation ~> on the set 
of events such that the following two conditions hold: 

1. Let (p,t) and (p,t') be two events such that t > to, t' is the greatest integer such that to < 
t' < t. Then, (p,t') ~» (p,t); 

2. Let (p,t) and (q,t') be two events such that (p,t) is not an internal event, q £ N p , t > to, 
and t' is the greatest integer such that to < t' < t. Then, (q,t ! ) ~» (p,t). 

Denote the causal order on the sequence 7071.... by ^. Relation ^ is the reflexive and transitive 
closure of the causal relation ~>. The past cone of an event (p, t) is the causal-DAG induced by every 
event (q,f) such that (q,f) ^ (p, t). A past cone involves a process q iff there is a g-event in the 
cone. We say that a past cone covers V, iff every process q £ V is involved in the cone. The cover 
of an event (p,t), denoted by Cover(p,t), is the set of processes q covered by the past cone of (p,t). 

Definition 2.3 (Cut) A cut C on a causal DAG is a map from V to N, which associates a process 
p with a time tp. We mix this map with its graph: C = {(p, tp) ,p € V} . 

The past of C, denoted by ]*— , C], is the set of events (p, t) such that t <tp. Similarly, we define 
the future of C, denoted by [C, — >[, as the set of events (p,t) such that tp < t. A cut is said to be 
coherent if (q,t') ^ (p, t) and (p,t) ^ (p, tp), then (q,t') ^ (q,tq) ■ A cut C\ is less than or equal to 
a cut C2 , denoted by C\ < C2 , if the past of C\ is included in the past of C2 ■ 
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If C\ and C2 are coherent cutes such that C\ -< C2, then [Ci,C2] is the induced causal DAG 
defined by the events (p, t) such that (p, i^ 1 ) < (p,t) ^ (p, i^ 2 ).A sequence of events is any segment 
[CijCy where C\ and C2 are coherent cuts satisfying C\ < C2. Any event of Ci is called an initial 
event. 

2.4 Self-Stabilization 

Let X be a set. A predicate P is a function that has a Boolean value — true or false — for each 
element x £ X . A predicate P is closed for a transition graph 5 iff every state of an execution e 
that starts in a state satisfying P also satisfies P. A predicate Q is an attractor of the predicate P, 
denoted by P t> Q, iff Q is closed for S and for every execution e of S 1 , beginning by a state satisfying 
P, there exists a configuration of e for which Q is true. A transition graph S is self-stabilizing for a 
predicate P iff P is an attractor of the predicate true, i.e., true D> P. 

3 Wavelets 

In this section, we first define the problem considered in this paper, followed by our self-stabilizing 
solution designed for any anonymous networks. Next, we show that it provides an efficient tool to 
compute any infimum at distance g. 

3.1 Problem Definition 

Let us assume that there exists a special internal type of events called a decide event. Let g be an 
integer. A g-wavelet is a sequence of events [Ci,C2] that satisfies the following two requirements: 

1. The causal DAG induced by [61,62] contains at least one decide event; 

2. For each decide event (p, t), the past of (p, t) in [61,62] covers V(p,k). 

A wave is the particular case where g > D, D is the diameter of the network. There are several 
way to implement the £>- wavelet scheme if the processes have Id's, for instance using the PIF scheme 
on trees with height equal to g rooted at each process. In the following subsection, we present a 
solution for the £- wavelet problem in anonymous networks. Next, we show how this solution provides 
a self-stabilizing infimum computation in a £-ball. 

3.2 Solution Description 

Our solution is based on the unison developed in [BPV04], which stabilizes in O(n) rounds in general 
graphs. Note that in a tree, we could use the protocol proposed in [BPV06]. It gives the better sta- 
bilization time complexity of at most D rounds. In the sequel, we first borrow some basic definitions 
and properties introduced in [BPV04], followed by our solution and its correctness proof. 

3.2.1 Unison 

Basic Definitions and Properties. Let Z be the set of integers and K be a strictly positive 
integer. Two integers a and b are said to be congruent modulo K, denoted by a = b[K] if and 
only if 3A <G Z, b = a + XK. We denote a the unique element in [0,K — 1] such that a = a[K\. 
min(a — b,b — a) is a distance on the torus [0, K — 1] denoted by dx(a, b) . Two integers a and b are 
said to be locally comparable if and only if dxia, b) < 1. We then define the local order relationship 
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<i as follows: o<;6^0<ii-a<l. If a and b are two locally comparable integers, we define 

def 

b Q a as follows: b Q a = if a </ 6 then b — a else — a — b. If ao, cti, a>2, ■ ■ ■ o, p -i,a p is a sequence of 

p-i 

integers such that Vi G {0, ... ,p — 1}, a, is locally comparable to Oj+i, then S 1 = ^ (oi+i © «i) is 

i=0 

the Zoca/ variation of this sequence. 

Define X = {—a, . . . , 0, . . . , K — 1}, where a is a positive integer. Let (p be the function from X to 

X defined by: tp(x) = if x > then x + 1 else x + 1. The pair (X, ip) is called a finite incrementing 
system. K is called the period of (X, tp). Let tail^ = {—a, . . . , 0} and ring^ = {0, . . . , K — 1} be the 
sets of "extra" values and "expected" values, respectively. The set tail* is equal to tail^ \ {0}. A 
reset on consists in enforcing any value of X to —a. We assume that each process p maintains a 
clock register p.r using an incrementing system (X,ip). Let 7 be a system configuration, we define 

dcf 

the predicate WU as follows: WU(j) = Vp G V,Vg 6 M p : (p.r £ ring^) A (|p.r — g.r| < 1) in 7. In 
the remainder, we will abuse notation, referring to the corresponding set of configurations simply by 
WU. 

In WU, the clock values of neighboring processes are locally comparable. In the sequel of the 
paper, we need the three following definitions: 

Delay. The delay of a path ji = pop\ . . - Pk, denoted by 5^, is the local variation of the sequence 

fe-i 

p .r,pi.r, . . . ,pk-r, i.e., = Yl (pi+i-r QiPi-r) if k > 0, otherwise (A; = 0). 

i=0 

Intrinsic Delay. The delay between two processes p and q is intrinsic if it is independent on the 
choice of the path from p to q. The delay is intrinsic iff it is intrinsic for every p and q in V . 
In this case, and at time t, the intrinsic delay between p and q is denoted by <5(p,q). 

WUo- The predicate WUq is true for a system configuration 7 iff 7 satisfies WU and the delay is 
intrinsic in 7. 

Unison Definition. Assume that each process p maintains a register p.r G x- The self-stabilizing 
asynchronous unison problem, or simply the unison problem, consists in the design of a protocol so 
that the following properties are true in every execution [BPV05]: 

Safety : WU is closed. 

Synchronization: In WU, a process can increment its clock p.r only if the value of p.r is lower 
than or equal to the clock value of all its neighbors. 

No Lockout (Liveness): In WU, every process p increments its clock p.r infinitely often. 

Convergence: T > WU. 

The following guarded action solves the synchronization property and the safety: 

Vq £ M p : (q.r = p.r) V (q.r = (p(p.r)) — ► p.r := <~p(p.r) 

The predicate WUq is closed for any execution of this guarded action. Moreover, for any execution 
starting from a configuration in WUq, the no lockout property is guaranteed. Generally, this property 
is not guaranteed in WU. 
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3.2.2 Protocol 

Variable and algorithm description. The protocol is shown in Algorithm 1. For each process 
p, let V(p, g) be the set of processes which are cooperating (or conflicting) with p. Each process 
q € V(p, g) is at most fc-hops away from p — d(p, q) < g. Let (x, V?) be an incrementing system, such 
that x = {-a, 0, 1, gK - 1}. In [BPV04], it is shown that: 

1. a greater than or equal to Tq ensures the convergence property of the unison, where Tq is the 
size of the greatest hole of G, i.e., the length of the longest chordless cycle of G if G contains 
cycle, 2 otherwise (G is acyclic); 

2. gK greater than Cq ensures the liveness property of the unison in WUq, where Cq is the 
cyclomatic characteristic of G, i.e., the smallest length of the longest cycle in the set of all the 
cycle basis of G. 

Note that Tq is upper bounded by n and Cq is upper bounded by min(n, 2D). We assume that the 
above two conditions are satisfied. 



Algorithm 1 (SS - WS) The Self-Stabilizing Wave Stream for p 
Constant and variable: 

Af p : the set of neighbors of process p; p.r G x; 

Boolean Functions: 

ConvergenceStepp = 
Locally Correctp = 
NormalStepp = 
Resetlnitp = 
Actions: 

NA : NormalStepp 



p.r G tail^ A (Vq G M v : (q.r G tail v ) A (p.r <tail v q.r)); 
p.r G ring^ A (Vq G Af P , q.r G ring<p A ((p.r = q.r) V (p.r 
p.r G ring v A (Vq G AT P : (p.r = q.r) V (q.r = ip(p.r)))\ 
-iLocally Correctp A (p.r tail^); 



if (q.r)) V (if (p.r) = q.r))); 



CA : ConvergenceStepp 
RA : Resetlnitp 



« CS 1 » ; 
if p.r = g- l[g] then << CS 2 » ; 
p.r := ip(p.r); 
p.r := (p(p.r); 
p.r := q (reset); 



Analysis of Algorithm 1 in WUq. By definition of WUq, the delay is intrinsic — refer to Sub- 
section 3.2.1. It defines a total preordering on the processes in V, so called precedence relationship. 
given a configuration in WUq, the absolute value of the delay between two processes p and q is equal 
to or less than the distance d(p, q) in the network. 

We will now prove that Algorithm 1 provides a ^-wavelet scheme. We will develop a proof 
technique called lifting. The idea behind this term is to interpret any possible configuration in WUq 
by another such that the register values are in N, the set of the positive integers. In this way, the 
precedence relationship becomes the natural order on N. It is possible because delay is intrinsic. 

Consider 7071... be a maximal execution starting in WUo. Let po be a minimal process, according 
to the precedence relationm. 70. Let _l_o = Po- r at time 0. Denote the value of a register r of a process 
p in the state by p l .r. Similarly, <5L ^ denotes the delay between p and q in 7 t . 

For each process p G V, we associate the virtual register p~7r. For the state 70, we initiate this 
virtual register by the instruction px := _l_o + ^ PoP )- During the execution, for each transition 
7t l— > 7t+i the instruction p?r := p7r + 1 holds if and only if p.r := p.r + 1 holds during the same 
transition. Denote by t p ^ the smallest time such that p7r := k. Since the delay is bounded by D, 



7 



if k > J-o + D, then t Pj k is well denned and the cut Ck = {{p,t Pi k),P G V} is well defined on the 
network. 

We now need to prove that for every k > _l_o + D, the cuts Ck are coherent. We first claim the 
following obvious lemma: 



From the Lemma 3.1, if (q,t) ^ (p,t Pi k) then (q,t) ^ (g, t q ^)- It follows: 
Lemma 3.2 For every k > _l_o + D the cut Ck is coherent. 

Lemma 3.3 Let k > _l_o + D. If (p,t) is an event in the interval [Cfc,— >[, then V(p,p t .r — k) C 
Cover(p, t). 

Proof. The statement holds for the initial events of [Ck, — >[. Let A be the set of events (p, t) 
in [Cfc, — ►[ such that V(p,p t .r — k) C Cover(p,t) does not hold. We assume that A is not empty, 
let (g, t) a minimal event in .4 according to ^. Let g = q T .r — k, and let p\ G V(g, g). If p± = q 
then pi G Cover(q,r), otherwise there exists q± G J\f q such that pi G ^(^l, — 1)- is not 

an initial event, so q\ G jV^ and there exists T qi such that (qi,T qi ) ~> (q,r). By the minimality of 
(g, r), V(gi,^— 1) C Cover(qi,T qi ) holds. So, V((7i,£>— 1) C Cover(q,r). Thus, pi G Cover(q,r). 
Therefore, (g, r) is not in A Thus A =0, and the lemma is proved. □ 

As a corollary of Lemma 3.3, the following result holds : 

Theorem 3.4 Let k > _l_o + D and g be a positive integer. Then, [Ck, Ck+ g ], with C^+q as the set 
of decide events, is a g-wavelet. 

3.3 Infimum Computation 

Problem definition. In [Tel88, Tel04], the author introduces the infimum operators. An infimum 
© over a set S, is an associative, commutative and idempotent (i.e. x © x = x) binary operator. 
If P = {ai, a2, a r } is a finite part of (S) and oq G (S) then, from the associativity, ©P means 
a\ © a2 © ... © a r . So, ao © P = ao © a\ © 02 © ... © a r . Such an operator defines a partial order 
relation <^ over S, by x <© y if and only if x © y = x. We assume that § has a greatest element 
e©, such that x <e e ffi for every x G S. Hence (§,©) is an Abelian idempotent semi-group with e© 
as identity element for ©. 

Theorem 3.5 ([Tel88, Tel04]) A wave can be used to compute an infimum. 

Self-Stabilizing Infimum Computation in a £-ball. In order to add a initializing step, we 
assume 5 = g + 1. We consider the following problem: at time Cjjs eacn register p.vo is initialized 
during the critical section << CR2 >>, precisely when the register p.r takes the value UCS. At the 
end of each phase <&u = [Cus, Cus+s-i] , each process p needs to known the infimum of the registers 
q.VQ of every q in V (p, g). 

To reach the objective, we define for each process p two added registers p.v\ and p.V2- These two 
registers are initialized at the date Cjjs during the critical section << CR2 >>, by the value p.vo- 




G{g*°.r,...,g*°.r + z} 
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For k G {1, 2, g}, at the date Cus+k, the action << CS1 >> is defined by: 
p.v\ := p.v 2 ; P-v 2 ■= P-Vq {Q-Vui(q), Q G with, if q.r = p.r then uj (q) = 2, and if q.r = p.r + 1 

then uj (q) = 1. 

Proposition 3.6 For p € V and G {1, at the date Cus+k, both equalities hold: 
(1) p. Vl = {q.v , q£V(p,k- 1)}, and (2) p.v 2 = {g.u , <j£^(p, fc)}. 

Proof. 

At Cj/,5, any process p satisfies p.t>i = p.^o and p.f2 = p.t»o, it is the initializing step. Let A the 
set of events in <&u = [Cus+i, Cus+g], for which the proposition is not true. 

Assume by contradiction that A is not empty. Let (p, t) a minimal event in A. Let k G {1, 2, £>} 
such that (p, t) G Cus+k- There exists to such that (p, to) £ ^(7 and (p, to) (p, t). We have p*.«i = 
p to .V2 = and p* .W2 = {q-vo, q G V (p,k — 1)}. This equality is true even if k = 1. Now, p*.^ = 
P- w o •jV 9 -'*V<j)) 9 G A/" p }. From the minimality of the event (p, t), the events (g, t ? ), where t q < t, 
are not in A and are in [C Ue , Cug+g-i]- So, p.t> {q tq - v <p{q), Q e -^p} = {Q- v o, 9 ^ V (p, fe)}. We 
obtain a contradiction. □ 

As a corollary, we obtain the expected theorem: 

Theorem 3.7 On the cut Cus+ e , P-V2 contains the infimum of the registers q.vo in the g—ball 
centered in p, according to the phase U = [Cus, Cus+g]- 

4 Applications 

In this part, we show how to synchronize a self-stabilizing layer clock. The main clock defines a 
wavelet stream. Using the wavelet stream, we design with the slave clock a barrier synchronization 
at distance p. We then show how to use this layer clock to tackle efficiently many local synchronization 
problems at distance p. 

4.1 Self-stabilizing Layer Clock 

The idea is to manage the £— wavelet stream. A clock organizes this stream. The wavelets are used to 
compute concurrently local infimum on each £-ball. For each process p, once the infimum computed, 
a second clock defines a delay notion on the network. This delay is a total preordering useful to 
schedule the critical section enter of each process. 

More formally, we define two clocks, the first clock C\ (the master clock) and a second clock C2 
(the slave clock). The incrementing systems are respectively (xi)Vi) an d (X2,<P2)- The behavior of 
the slave clock is scheduled by the first clock and a predicate cond. The predicate cond depends 
of the problem solved. To distinguish the two clocks, all the registers are subscripted by 1 or 2 
respectively for the master clock and the slave clock. The predicates are superscripted by 1 or 2. 
For instance, the register of the master clock is denoted by r\ and the register of the slave clock is 
denoted by and the predicate NormalStepp is defined on the register r\ of the process p. The 
predicate NormalStepp is defined on the register r 2 of the process p. We define in the same way 
WU U WU 2 , and WU = WU X n WU 2 . The stabilization of the layer clock is to guarantee T t> WU. 

When the system is stabilized, the schedule of the slave clock of a process p is defined in << 
CS2 » by 

NormalStepp A cond — > << CS2 >>; p.r2 := tp 2 (p.r 2 ) 
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Predicate cond is independent of the register p.r\. It is this predicate which expresses the distance g 
synchronization problem solved, while the procedures Initialization and Computation are scheduled 
by the wavelet and provide a preprocessing used by cond. The procedures Initialization and Compu- 
tation depends on the problem to be solved to the layer clock. We give some instances for different 
problems later. 

We define the predicate NormalStep p = NormalStepp A Locally C or rectp. 



Algorithm 2 (SS - DC) Self-stabilizing Layer Clock 

Constant and variable: 

J\f p : the set of neighbors of the process p; 
P-ri G Xi; P-r2 G X2; 
Boolean Functions: 
For clock i G {1,2}: 

ConvergenceStepl = p.n G tail* Vi A (\/q G N P : (q.ri G tail Vi ) A (p.n <taa v . q.ri)); 
Locally Correctp = p.n G ring Vi A 

Vq G N p ,q.r G ring lflt A ((p.n = q.n) V (p.n = <£i (q.ri)) V (fi (p.ri) = q.ri)); 
NormalStepp = p.n G ring H>i A (Vq G M v : (p.ri = q.ri) V (q.ri = <p(p.n))); 

Resetlnit'p = -iLocallyCorrectp A (p.n & tail Vi ); 

Common predicate: 

NormalStepp = NormalStepp A Locally C or rect^; 
Actions: 

N A : NormalStepp — > if p.n = g — l[g] then 

Begin 

NormalStepp A cond — > << CS 2 >> ; 

if condi then p.r2 := f2 (p.r2) 

Initialization ; 
End 

else Computation; 
p.r x := ip 1 (p.r 1 ); 

For clock i G {1,2}: 

CAi : ConvergenceStepp — > p.ri := ifii(p.n); 
RAi : Resetlnit'p — > p.ri := cti (reset); 



Due to the lack of space, the proofs of Proposition 4.1 and 4.2 are left in the appendix. 
Proposition 4.1 The layer clock stabilizes to WU. 

Proposition 4.2 (No starvation) Once stabilized, the clock Ci increments infinitely often. 
4.2 Local Comparison in a £>-ball 

In the network, when the layer clocks are stabilized, the delay between two processes according to 
the slave clocks defines a total preordering on the processes. Unfortunately this delay is a global 
notion. The problem is to find a condition such that for two processes in the same £>-ball, it is 
possible to calculate directly the delay with only the knowledge of the r^ register values, and so to 
decide which process precedes the other according to the delay. To organize comparison between 
two processes lying in a same ball of radius equal to g, it is sufficient to be able to compare the 
slave clock registers of any two processes at distance less than or equal to 2g. By this way, in each 
fj-ball B, we will be able to define a total preordering among the processes in B by comparison of the 
values of the registers r<i of the processes. Of course we want that this total preordering is the same 
than the preordering defined by the delay. In order to reach this objective, we extend the locally 
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comparability defined at one hop (refer to Section sub:unison) to the distance 2g. For the clarity, we 
must be more formal: A local order on a set x is an antisymmetric and reflexive binary relation on 
X- Let x = {0,...,K — 1} such that K > 4g + 1. Let a and b be two elements of x- Let us assume 
that dx (a, b) < 2g. Let us define now a local ordering <i by : a <i b ^def < b — a < 2g. 

Lemma 4.3 Let p and q be two processes satisfying d (p, q) <2g . If a = p.ri and b = q.T2 then the 
delay g p>q is equal to b — a if < b — a < 2g and is equal to —a — b otherwise. 

Proof. Since d(p,q) < 2g, we have g p<q G {— 2g, 2g}. Moreover, g p<q = b — a[K]. Since, 
K > 4g, then g Ptq is equal to b — a if < b — a < 2g, —a — b otherwise. □ 

From this lemma, we access to the delay in each g— ball B. So our problem is solved. In the follow- 
ing section, we assume that g = g + 1, K\ = gK with K\ > Cg — 1 and K2 > max (4g + 1, Cg — 1)- 
These assumptions ensure that the layer clocks are self-stabilizing, that the main clock is calibrate 
to defined a £-wave stream , and that delay defined by the slave clocks is computable at distance 2g 
with the only knowledge of the slave clock registers r2- 

4.3 £>-Local Resource Allocation 
4.3.1 Problem Definitions 

The Resource Allocation problem deals with resource sharing problems among the processes. The 
resource allocation allows processes to access resources, (i.e., their critical sections) concurrently, 
provided the resources are not conflicting with each other. 

Definition 4.4 (Graph of Compatibility [CDP03]) Let A be a set - sometime named the re- 
source set-, let R be a reflexive binary relationship on A. We say that R is the compatibility rela- 
tionship on A. If (a, b) € R, then we say that a and b are compatible. If (a, b) £ R then we say that 
a and b are conflicting. 

The specification of the general resource allocation problem is defined as follows: 

Safety: if a processor requests a resource in A to enter in critical section, then its request is eventually 

satisfied and it enters the critical section. 
Fairness: In every execution, if two processes execute their critical section simultaneously, then 

both are using resources whose are compatible. 

Most of the problem requiring coordination among process sharing some resources are particular 
instances of the graph of compatibility, and then, particular safety requirements. For instance, the 
following well-known problems are particular instances of the resource allocation problem: 

Mutual exclusion: A is the set of processes and R = {(a, a) ,a G A} . The safety condition is: In 

every execution, no two processes execute their critical section simultaneously. 

Readers- Writer: A is the set of 2-uples {(p, r) ,p £ V,r G {read, write}}, and R is defined by the 
safety condition: In every execution, if two processes execute their critical section simultane- 
ously, then both are executing a read operation. 

Group Mutual Exclusion: R is a equivalence relationship over a set of resources A. R is de- 
fined by the safety condition: In every execution, if two processes execute their critical section 
simultaneously, then both are using resources in the same equivalence class. 
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We now generalize the above requirements by limiting their effect to the conflict processes which 
are at £-hops away of any given process p. Obviously, if g = D, then the problem comes down to 
the above requirements. If g = 1, then the set of processes which are conflicting with a process 
p is reduced to the neighboring processes of p. The most popular of this problems is the dining 
philosopher problem, also called the Local Mutual Exclusion (LME) problem. 

^-Local mutual exclusion: A is the set of processes and R = {(a, b) G A 2 , d(a,b) > g or a = 6} 

^-Local Readers- Writer: A is the set of 2-uples {(p, r) ,p G V, r G {read, write}}, and R is defined 

by the safety condition: In every execution, if two g-neighboring processes execute their critical 
section simultaneously, then both are executing a read operation. 
^-Local Group Mutual Exclusion: R is a equivalence relationship over a set of resources A . R 
is defined by the safety condition: In every execution, if two g-neighboring processes execute 
their critical section simultaneously, then both are using resources in the same equivalence class. 

The g generic version called the ^-Local Resource Allocation (^-LRA) is specified as follows: 
R is a general relationship over a set of resources A. The safety condition is: In every execution, if 
two g-neighboring processes execute their critical section simultaneously, then both are using resources 
whose are compatible. 

4.3.2 Self-Stabilizing Solutions 

Due to the lack of space, the correctness proofs of this section are left in the appendix. 

£-LRA. Each process p maintains three registers p.v : S where £ is any data type, and the registers 
p.resi : \2 and p.res2 ■ X2 x S. The content of p.v is the asked resource. Let us assume that a 
total order ^ is defined on S. We reach the two fields of the register p.resi by p.resi.r and p.resi.v 
respectively, with i G {1, 2}. We define a g— local ordering on \2 x S by: 

(r, v) < (r 1 , v') -£4> (r <; r') or (r = r' and v ■< v') 

Recall that <; is defined by the delay, which is a total preordering. This preordering being computable 
at distance g. Define the associated £-local infimum in the following way: 

(r,v) © (r',v r ) = if (r,v) < (r',v r ) then (r,v) else (r',v') 

Define the macros Initialization and Computation, respectively by: 

Initialization = p.res\ := (p.r2,p.v) ; p.resi ■= (p.V2,p.v) 

Computation = p.res\ := p.res^, p.res2 ■= (p.T2,p.v) {q.res^^, q G M p } 

with, if q.r2 = p.r2 then u (q) = 2 and if q.r2 = p.r2 + 1 then uj (q) = 1. The preprocessing of a local 
infimum designates a winner q in the g— ball centered in p. It is important to see that this process 
is elected by p, and perhaps it is not elected by all the processes in the g— ball centered in q. The 
condition cond depends of the solved problem, it is the disjunction: (p.r2,p.v) = p.res2 or cond2- 
Where (p.r2,p.v) = p.res2 means that p is elected in the g— ball centered in p, the condition cond2 
is there to raise concurrency, it depends of the solved problem. If p is not elected by the g— local 
infimum calculation and cond2 is true, the incrementation of the register may to be not wanted. The 
condition cond\ of the incrementation is: p.r2 = p.res2-r. 
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£-LME. Assume that each process p has an identifier denoted by p.id. The value of p. id is in 
a total ordered set S, for instance the integers. In fact, the identities must be only a 2 ^-distance 
network coloring. That is to say that every node must be colored such that two vertices lying at 
distance less than or equal to 2g have not the same color. We define for each process p the register 
p.v as p.id. The couple (p.r2,p.id) is defined for each process p. The condition cond2 is defined by 
false and thus cond\ = true. 

Definition 4.5 g-LME has a fairness index ofk, if in any computation, between any two consecutive 
critical section execution of a process, any other process can execute its critical section at most k 
times. The time-service of g-LME is the maximal number of critical executions by other processes 
between two successive executions of the critical section by any process. 

Our g-LME algorithm is a barrier synchronization at distance g. We deduce: 

and the service time 



Proposition 4.6 For the g-LME algorithm, the fairness index is equal to 
is upper bounded by 



n(n— 1) 
Q 



From the definition of a phase [BPV04], and because K\ is in O {gD) C O (D 2 ), K-i is in O (D), 
respectively. So, K\Ki is in O (D 3 ). Thus: 

Proposition 4.7 (1) During one phase, the number of link- communications is equal to 2 (g + 1) \E\, 
where \E\ is the number of edges in the network. (2) The stabilization time complexity of g-LME 
algorithm is in O (n) rounds, and (3) The space complexity of g-LME algorithm is in O (logD). 

Our solution for the £>-LME problem provide a good technique to reduce the service time and the 
fairness index of LME. The price to pay is the increase of the communications between processes. 



^-Group mutual exclusion. Let X be the set of resources. Let us assume that the preordering 
^ is defined on E -an arbitrary ordering, for instance a priority ordering. The binary relationship 
on A, defined by: x x y iff x ■< y and y ^ x, is an equivalence relationship. The equivalence classes 
are the groups. The set of groups is the quotient ^. p.v takes its values in |j. An other way is to 
say that ^ is an total ordering on the groups. To raise concurrency, if p asks a resource a and if the 
elected process at distance g requests a resource in the same group, p enters in critical section. The 
predicate cond is defined as: p/U = p.res.v. 

Note that we assume that there is no identity on the processes. However, we make the additional 
assumption that there is a total ordering on the resources. For instance, Local Mutual Exclusion 
problem is an instance of Group Mutual Exclusion where the resources are the processes. So, there 
is an ordering on the processes, which equivalent to define process identities. 



^-Readers- Writer. We assume that each process has an identity denoted by p.id. Each process 
has three possible requests: the process does not ask anything, the process asks to read, the process 
asks to write. This requests are symbolized respectively by N, R, W. In order to be able to compare 
two registers r at distance g, we assume that K > 4g + 1. For each £-ball B, the local ordering </ 
defines a total preordering on the registers r of processes in B. If a process p asks N or R then the 
register p.v is initialized by the value F. If p asks W then p.v is initialized by the value Wp.id) . 
The ordering ^ on S is defined by: 
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, dcf (v = F and v' = F) or (v = WId and v' = F) 
V ~ V = or (v = W Id and v' = W Id' with Id < Id') 

For each process p, the predicate cond is defined by matching on p.res as follows: 

(r, F) when r = p.r2 — > true 

| (r, VFid) when r = p.T2 and p requests N — > true 

| (r, Wzd) when r = p.r2 and (p.r2,p.id) = (r,id) — > true 

I __ — > false 



5 Conclusion 

We presented a self-stabilizing algorithm to solve the ^-wavelet scheme in arbritrary anonymous 
networks. Wavelets deals with coordination among processes which are at most g hops away of 
each other. The proposed algorithm works under any (even unfair) daemon. Using the wavelet 
scheme, we described a self-stabilizing layer clocks protocol and showed that it provides an efficient 
device in the design of local coordination problems at distance g, i.e., £>-barrier synchronization 
and £-local resource allocation (LRA) such as g- local mutual exclusion (LME), ^-group mutual 
exclusion (GME), and ^-Reader /Writers. Some solutions to the £>-LRA problem (e.g., g-LME) allow 
to transform algorithms written assuming any £>-central daemon into algorithms working with any 
distributed daemon. 
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A Self-stabilization of the layer clock 

We apply the convergence stair method [DolOO]. 

Lemma A.l The predicates WUi,WU2 and WU are closed. 

Proposition A. 2 The clock C\ stabilizes to WU\. 

Proof. Let e = 71 "fk— De a maximal execution. Assume that e is finite. Then, the last 

state 7; is a deadlock. So, the clock C\ is stabilized, otherwise there should exist a process for which 
C A\ or RAi is enable. We suppose now that e is not finite. The projection e\ of e on the registers 
77 is an execution of the clock C\. If ei is finite, then in the last state C\ is stabilized for the same 
reasons than above. If e\ is not finite, then e\ is an infinite execution of C\, so from [BPV04] there 
is a state which is in WU\. □ 

Proposition A. 3 The clock C2 stabilizes to WU2 ■ 

Proof. Let e = 71 Jk---- be a maximal execution. We can assume from Proposition A. 2 

that 71 £ WU\. So while C2 is not stabilized, C\ does not execute any action. So the projection 
e2 of e on the registers r2 is an execution of the clock C 2- If e2 is finite, then in the last state C2 
is stabilized otherwise there should exist a process for which CA 2 or RA 2 is enable. . If e2 is not 
finite, then e2 is an infinite execution of C2, so from [BPV04] there is a state which is in WUi- 

□ 

From Proposition A. 2 and Proposition A. 3 we deduce the corollary: 
Corollary A. 4 The layer clock stabilizes to WU. 

Proposition A. 5 (No starvation) Once stabilized, the clock Ci increments infinitely often. 

Proof. Let e = 71 7^.... be a maximal execution. We can suppose from Corollary A. 4 that 

71 £ WU. Assume that for a process p, action NA is executed only a finite number of time. Then 
the clock of each process is executed only a finite time, so e is finite. But in the last state of e, 
minimal processes according to the precedence relationship are enable, which is a contradiction. 

While C2 is not stabilized, C\ does not execute any action. So the projection e2 of e on the 
registers r2 is an execution of the clock C2. If e2 is finite, then in the last state C'2 is stabilized 
otherwise there should exist a process for which CA2 or RA2 is enable. If e2 is not finite, then e2 is 
an infinite execution of C2. From [BPV04] there is a state which is in WU2- □ 

B Correctness Proofs of Subsection 4.3.2 

£-LRA. In order to proof liveness and no lockout property, we lifts the main clock, using the lifting 
construction defined in section 3.2.2. We use the same notation, except that the register r becomes 
the register 77. As in Section 3.3, 5 = p + 1. 

Lemma B.l (No lockout) In each phase [Cus, Cus+g], there is at least one process which is elected 
and which increments. 
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Proof. The set of processes is finite. So there is an infimum (p.r2,p.v) among the processes, 
according to the total order relationship ^. p is elected during the phase [Cus, Cus+g] and p.ri 
increments. □ 



Lemma B.2 (Liveness) Every process has the privilege infinitely often. 

Proof. It is sufficient to prove that each process p has the privilege at least one time. Assume 

that a process p has never the privilege. Let Pot p = \p,q) ^ e * ne some °f the delays from p to 

qev 

each q in V. This quantity is an integer, and from the assumption that p has never the privilege, 
Potp is strictly increasing. So Pot p is not upper bounded. But this quantity is upper bounded by 
\V\ D. This is a contradiction. We deduce the lemma. □ 

£-LME. From Section 4.3, both the no lockout and liveness properties are verified. It remains to 
show the safety property. 

Lemma B.3 (Safety) If the process p has the privilege, then no process at distance less than or 
equal to g from p has the privilege simultaneously. 

Proof. Assume that p has the privilege in the phase [Cus, Cus+g], it enters critical section 
when its register satisfies p.r\ = US + g. Any other process q at distance less than or equal to g from 
p has not the privilege in the phase [Cus, Cus+g]- So if q £ B g (p) has the privilege simultaneously, 
it does not have the privilege in the same phase. The absolute value of the delay, according to 
the first clock, between the two processes is less than or equal to g. But when p enters critical 
section and while p is in critical section: q.r\ € {US + g — g, US + g + g} = {US, U g + 2g}, 
and US + 2g < (U + 1) S + g. We deduce that q can enter in critical section simultaneously if and 
only if q.r\ = US + g, thus in the same phase as p, which leads to a contradiction. □ 

^-Group mutual exclusion. Again, from Section 4.3, both the no lockout and liveness properties 
are verified. The next lemma directly follows from the construction of cond: 

Lemma B.4 (Safety) If the process p and the process q have the privilege simultaneously and are 
at distance less than or equal to g, the requested resources are in the same group. 
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